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Abstract. We give a new proof of a polynomial recurrence result due to 
Bergelson, Furstenberg, and McCutcheon, using idempotent ultrafilters instead 
of IP-limits. 



Introduction 

In the thirty or so years since H. Furstenberg reproved Szemeredi's theorem using 
methods from ergodic theory, many striking discoveries have been made in the area 
now known as Ergodic Ramsey theory. Perhaps the most surprising of these is the 
discovery that recurrence results can be obtained for polynomial sets, meaning sets 
of values of polynomials. The following pretty theorem, a special case of a more 
general theorem proved by V. Bergelson, H. Furstenberg, and R. McCutcheon in 
P], is a typical result in this direction. 

Theorem. Let T be the collection of all non-empty finite subsets of N. For 
any polynomial p S Z[xi, . . . , xu] satisfying p(0, . . . , 0) = 0, and for any IP-sets 
q£ jf, the set 

R={p(n£\...,nW)\aef} 

is a set of nice recurrence. 

To say that R C Z is a set of nice recurrence means that for any probability 
space (X, B, fi), and any invertible measure-preserving transformation T on X, one 
has 

limsup^(AnT"A) > ^{Af 
for all A £ B. Moreover, an IP-set is any set of the form 

|n Q = Yn l \aeTy 

for positive integers no, ni, rz.2, 

As in Furstenberg's result, this inequality has immediate combinatorial appli- 
cations. It also turned out that the above theorem was only a first step; much 
stronger results — combining IP-convergence, multiple recurrence as in Szemeredi's 
theorem, and polynomial sets — have since been established, for instance in [2]. 

The purpose of the present paper is to give a different proof for the central result 
of PQ, using idempotent ultrafilters instead of IP-limits. While this approach is less 
constructive, it has the advantage of "making the statements and proofs cleaner 



2000 Mathematics Subject Classification. 37A45; 28D05; 54D80. 

Key words and phrases. Idempotent ultrafilter, IP-set, Polynomial recurrence, Measure- 
preserving dynamical system. 

1 



2 



C. SCHNELL 



and more algebraic," in the words of the survey paper [3]. It also follows the general 
philosophy that for each result about IP-sets, there should be an analogous result 
about idempotent ultrafilters. 

The main theorem and its proof are presented in Section [5J however, a better 
point to begin reading is probably Section[2j which treats a special but typical case, 
and explains the method of proof in some detail. Section [6] contains a small number 
of applications, of the type mentioned above. 

Since ultrafilters on groups and semigroups are used throughout the paper, their 
basic properties are reviewed in Section [TJ readers who are already familiar with 
/3N, for instance from [3], will recognize all the material, despite the more general 
context. To keep the paper self-contained, several generally known results about 
operators and integer-valued polynomials have also been included; these make up 
Sections [3] and |U 

Note. Vitaly Bergelson, who advised me during my first two years in graduate 
school, suggested the problem of reproving the results in [1] using idempotent ul- 
trafilters. I am very grateful to him for his help, as well as for countless pleasant 
conversations. Unlike wine, the paper has failed to mature during the several years 
that it has been stored on the hard drive of my computer; nevertheless, I have de- 
cided to make it available, since it is in my opinion a nice application of idempotent 
ultrafilters to recurrence results. 

1. The Stone-Cech compactification of a discrete semigroup 

Ultrafilters. We begin by reviewing the definition and several basic properties of 
the space of ultrafilters. Let (S, o) be a commutative semigroup. An ultrafilter on 
S is a collection p of subsets of S with the following four properties: 

(1) S £p and g> 

(2) If A G p, and B D A, then B G p. 

(3) If A, B G p, then A n B G p. 

(4) For every ACS, either A G p, or S \ A G p. 

For every s G S, there is a principal or trivial ultrafilter consisting of all subsets 
containing s; the construction of other ultrafilters requires the Axiom of Choice. 

The space (3S of all ultrafilters on S, suitably topologized, is the Stone-Cech 
compactification of the discrete space S. After briefly stating the basic properties 
of f3S, we will consider two examples: one where S is the group Z™, and a second 
one where S equals T, the set of nonempty finite subsets of N. A good and very 
comprehensive reference for this topic is the book by Hindman and Strauss [?]. 

Terminology. Since ultrafilters are collections of sets, the following terminology 
is convenient when dealing with their members. If p is an ultrafilter on S, we call 
a set p-big if it is contained in p; we shall also use the phrase '/or p-many s' to 
mean 'for all s in some p-big set.' In the case of several variables, we shall say that 
(statement) holds '/or p-many s\, . . . , s„' if 

{si G S | {s 2 G S | • • • {s n e S | (statement)} €p---} G p} G p. 

In other words, there should be p-many si, for which there are p-many S2, for which 
. . . , for which there are p-many s n , for which (statement) is true. Nested sets of 
exactly this form will play a role during the proof of the main theorem in Section [5] 
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Basic properties. As was said above, we let j3S be the set of ultrafilters on S, 
and consider S as a subset of [3S, by identifying an element of S with the principal 
ultrafilter it generates. One can put a topology on (3S, in which the sets 

A = {p G PS | A G p} (for A C S) 

give a basis for the closed sets; each A is both closed and open. The result is a 
compact space (this includes the Hausdorff property) that has S as a discrete and 
dense subspace. 

The semigroup operation o extends to 0S; given p and q in (3S, their product 
p o q may be defined by the property that for any ACS, 

Aepoq {s e S\{t e S\s ot e A} e q} e p. 

The new operation is associative and continuous from the left (meaning that for any 
q, the map p i— > p o q is continuous), and makes (3S into a compact left-topological 
semigroup. 

Idempotent ultrafilters. An ultrafilter p G (3S is called idempotent if it satisfies 
the relation p o p = p. Idempotent ultrafilters are closely related to IP-sets, which 
are sets of the form 

{IT Si | a *~ ^ nm te, nonempty | , 

for a given sequence (si)^. Any member of an idempotent ultrafilter contains 
an IP-set, and conversely, every IP-set is contained in some idempotent ultrafilter. 
This fact is sometimes called Hindman's theorem (see [3J Theorem 3.4] for details); 
it implies that one can find many idempotent ultrafilters (provided, as usual, that 
the Axiom of Choice is assumed). 

When a finite sequence s\,...,s n is used in place of an infinite one, we shall 
denote the resulting finite IP-set by IP(si, . . . , s n ). The proof that any member of 
an idempotent has to contain an IP-set allows a much stronger conclusion if we are 
only looking for finite IP-sets. 

Lemma 1. Let p G f3S be an idempotent ultrafilter. If A is a p-big set, then for 
any neN one has 

{si g A \ •••{s„ eA\ IP(si,...,s„) cA}ep---}ep. 

In the terminology introduced above, one can say that there are p-many si, . . . , s n 
in A such that IP(si, . . . , s n j C A. 

Limits along ultrafilters. Another useful notion is that of a p-limit, or a limit 
along some ultrafilter. Let p G (3S be an ultrafilter. Given a map f:S—>Y into 
some topological space Y, we say that a point y is a limit of / along p, written 

V =P-lim/(s), 

if for every neighborhood U of y, the set / _1 (J7) is p-big. When the target space 
Y is compact, all p-limits exist and are unique. 

This notion of limit is related to the Stone-Cech compactification in the following 
manner. A compactification of a Hausdorff space X is a compact space containing 
A as a dense subspace. The Stone-Cech compactification (3X is the universal com- 
pactification, in the sense that for any compact space Y and any continuous map 
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/: X — ► Y, there is one and only one continuous extension /* from j3X to Y, as 
illustrated in the diagram. 

X C >(3X 

/ 

/ 

/u 



Every other compactification is a quotient of (3X: furthermore, if g: Y — ► Z is 
a second continuous map of compact spaces, one has (fg)* = f*g* because of the 
uniqueness statement. 

Now the space (3S, as defined above, is the Stone-Cech compactification of the 
discrete topological space S; given any map / : S —> Y into a compact space Y, the 
required extension /* : {3S — > Y is given by 

/.(p)=p-lim/(«) (forpe/JS), 

s 

which is continous as a map from /JS to Y. 

The following lemma is an immediate consequence of the universal property. 

Lemma 2. Any map <f>: S — > T between two semigroups S and T induces a con- 
tinuous map 0* : f3S — » /3T, given &y 0*(p) = p-lim s </>(s). ^4 se£ £> is <ft*(p)-big if, 
and only if, its preimage _1 (.B) is p-big. For any map f : T — ► Y into a compact 
space Y, one has 

p-lim/(0(«))=0.(p)-lim/(t). 

s t 

J/ is multiplicative, so is (j>* ; in particular, </>* (p) zs i/ien always idempotent for 
idempotent p £ /JS 1 . 

There is another important property of p-limits, especially useful for our pur- 
poses. 

Lemma 3. Let p and q be two elements of (3S. The equality 
p-limg-lim/(s o t) = (p o q)-limf(s) 

s t s 

holds for any map f : S — > Y into a compact space Y. 
Proof. Let y = p-lim s g-lim t f(s o t); for any neighborhood U of y, the set 

{s | g-lim/(s ot) elf} 

is p-big. Equivalently, 

{s\{t\f(sot)eU} eq} ep, 

and this is nothing but the condition f~ l {U) Gpog. It follows that the right-hand 
limit (p o q)-lim s f(s) also equals y. □ 

The lemma explains one useful aspect of idempotent ultrafilters — if p is an idem- 
potent, one has 

(1.1) p-limp-lim/(s o t) =p-lim/(s), 

s t s 

and this relation is at the base of all applications of ultrafilters to recurrence results. 
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As an application, let us prove a lemma known as van der Corput's trick, for 
p-limits. It provides a useful sufficient condition for a weak p-limit in a Hilbcrt 
space to be zero. 

Lemma 4. Let Y be a closed ball in a Hilbert space 7i, endowed with the weak 
topology (and thus compact). Given a map f : S — > Y and an idempotent p G /3S, 
let y = p-lim s f(s). If 

p4imp-lim(/(soi),/(t)) = 0, 

s t v ' 

then y = 0. 

Proof. One uses (jl.ljl in a clever way. Notice that for any iVeN, 

1 " 

?/ = p-lim • • -p-lim — V" f(s n o • ■ • o s N ). 

si s N JV z — ' 

n=l 

Using weak lower semi-continuity of the norm, we obtain 

i2 



A 2 



,/|| < p-lim- • -p-lim -Ul /(s„ o • • • o s N 



n=l 

JV N 



s N N 2 

= p-lim ■ • • p-lim — ^ ^ ^ /(s m o • • • o s N ), ^ /(s„ o-os^ 

m— 1 n — 1 

= "^72 y^P-lim- • -p-lim(/(s m o • • • o s N ),f(s n o • • • o sjv)), 

iV z * — ' Sl S N 

and after collapsing the multiple p-limits with the help of this becomes 

1 2 

= ]v2 ^P" 1 ™!!^^)!! 2 + Re H P-limp-lim(/(so *),/(*)) 

n m<n 
1 2 

= — p-lim /(s) . 
TV s 11 11 

Since iV was arbitrary, we see that y = 0. □ 

We shall now discuss two concrete examples of semigroups and their Stone-Cech 
compactifications, namely /3Z™ and j3T. 

Abelian groups and /3Z". We are going to use vector notation for elements of 
Z™, such aso = (oi, . . . , a„). Even though Z™ is a group, the space of ultrahlters 
/3Z n is only a semigroup, because there are in general no inverses for elements. Still, 
we can get information about ultrahlters in /3Z™ from the group structure of Z"; in 
particular, we shall investigate the relationship between subgroups and idempotent 
ultrahlters. 

Every subgroup of Z n is itself free, of rank between and n. The first observation 
is that subgroups of rank n are contained in every idempotent ultrahlter. 

Lemma 5. For every idempotent p € /3Z", all rank n subgroups are p-big. 

Proof. A subgroup L of rank n necessarily has hnite index. As an ultrahlter, p thus 
has to contain one of the cosets, say z + L, and as an idempotent, it then has to 
contain the set 

{a e z + L\ {b e z + L\a + b S z + L) Gp} 
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as well. In particular, that set is nonempty. The resulting equation z + z = z 
mod L gives z + L = L, and we can conclude that L itself is p-big. □ 

We now define the dimension of an ultrafilter p, denoted dimp, to be the small- 
est possible rank of a p-big subgroup of Z™ . Since we expect p-big sets to be large 
(especially when p is an idempotent), it would be nice if the dimension of an ultra- 
filter in /3Z™ was always n. This is not true; for instance, the principal ultrafilter 
generated by is idempotent, and has dimension zero. But as the following lemma 
shows, in all such examples, the ultrafilter in question really lives on a smaller 
group. 

Lemma 6. Let p E (3Z n be an ultrafilter, of dimension s G {0, . . . , n}. //GCZ" 
is an arbitrary p-big subgroup of rank s, then there is an injective group homomor- 
phism 4>:7L S — > Z™ with image G, and an s-dimensional ultrafilter q £ (51/ , such 
that p = 4>*{q). If p is idempotent, then any such q is idempotent as well. 

Proof. Let G be a p-big subgroup of rank s in Z™ . Since it is free, it is isomorphic 
to Z s , and so there is an injective group homomorphism <f>: If — > Z" whose image 
is exactly G. Define 

q={ACZ s \cb(A)e P }; 
since G is p-big, it is easily verified that q G /3Z S , and that (q) = p. Now q has 
to have dimension s, for otherwise Z s , and therefore also G, would contain a p-big 
subgroup of smaller rank, contradicting the choice of s. 

Now assume that p is an idempotent ultrafilter. Since 0» is a homomorphism, 
we get 

4>*{qoq) = (p*(q) ° <f>*(q) = p o p = p; 
but as q is obviously uniquely determined by the condition that (q) — p, it follows 
that q o q = q, and so q is idempotent as well. □ 

IP-sets and /3J 7 . Our second example is the Stone-Cech compactification of T , 
the set of finite nonempty subsets of N. For any two such finite sets a and [3, we 
may form their union aU/3; this operation makes T into a commutative semigroup. 
An ultrafilter in this setting is now a set of sets of finite subsets of N; to avoid 
confusion, we shall reserve the letters a,/3, 7 for points of T. We also continue 
to write o for the semigroup operation on j3T. The character of this operation is 
utterly different from addition on /3Z; for instance, any principal ultrafilter is now 
idempotent. 

We are mostly going to look at IP-sets in N and T from the point of view of (3J-. 
Since the semigroup operation on N is addition, an IP-set is now a set of the form 

where (ni)^ is a sequence of positive integers. This can also be considered as a 
map 

n. : T — > N, a. n a = Hi 

that satisfies 

(1.2) n aU (3 = n a + np 

for disjoint a, [3 £ J 7 . It induces a map n„ from [iT to (3N that we would like to 
be structure-preserving, in particular with regard to idempotents, but this cannot 
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be true. The problem is that the map n, fails to be additive because ljl.2)l holds 
for disjoint sets only. One answer is to look at a subclass of ultrafilters in [3J 7 , 
excluding — among other things — the principal ones. 

One quickly sees that in order to make use of (|1.2|) . it has to be possible, when 
choosing f3 inside a member of some ultrafilter, to make it disjoint from a given a. 
To accomplish this, we introduce the following notion. We let 

C n — {a e T | n e a} (for n G N), 

and call an ultrafilter congested if it contains one of the C n , or uncongested if it 
contains none. Certainly, every principal ultrafilter is congested. We also introduce 
the notation a < (3 to express that the maximum of the finite set a is less than the 
minimum of (3. 

Whenever p is an uncongested ultrafilter and a 6 J 7 , the set 

{!3eT\a<l3} 

is obviously p-big, being an intersection of complements of certain C n . This means 
that for any set A 6 p, the set of (3 £ A with a < (3 is still p-big, and so we can 
impose the even stronger condition a < (3 when choosing an element (3 from A. 

Now let us see what the set (3J- un of all uncongested ultrafilters looks like. An 
IP-ring is a special type of IP-set in T\ it consists of an infinite sequence (aj)jgN 
of elements of T satisfying olq < ct\ < ct2 < ■ ■ ■ , together with all possible finite 
unions of these. The notation 

IP(ci!i)ieN 

will be used for such IP-rings. We then have the following result about f3J- un and 
its connection with IP-rings. 

Lemma 7. (3J- un is a closed (hence compact) sub-semigroup of f3T '. Every idem- 
potent p € (3T un has the property that if a set is p-big, it contains an IP-ring. 
Conversely, every IP-ring is a member of some uncongested idempotent. 

Proof. By definition, 

is an intersection of closed sets, hence closed. Let us show that it is also a semigroup. 
If a product p o q is congested, it has to contain C n for some n, and so 

{a | [f3 | n 6 a U (3} E q} e p. 

If C n is not in p, one of the a in the outer set does not contain n, in which case the 
inner g-big set equals C n . Either way, one of the two factors is congested; products 
of uncongested ultrafilters are therefore uncongested. It follows that [3!F un is a 
compact semigroup. 

To verify the second statement — existence of IP-rings in p-big sets for idempotent 
p — the same proof as for Hindman's theorem will work; when choosing elements, 
one simply follows the recipe mentioned above. 

Third, let us show that every IP-ring is contained in some uncongested idem- 
potent (this also proves the existence of uncongested idempotents). Let lP{ai)i^ 
be an IP-ring and set A n = IP(cti)i> n , with obvious meaning. Following the usual 
procedure, we will show that the intersection 

a= p| a; 

nGN 
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is a closed, nonempty subsemigroup of l3T un ; by Ellis' theorem it then has idem- 
potents, and any such idempotent is uncongested and contains our IP-ring. 

A is certainly closed and nonempty (use the finite intersection property of the 
compact space (3J-). To verify that it is a semigroup, we need to check that for 
p, q G A and any n G N, the set A n is a member of p o q. Given a £ A n , there is 
some k > n for which a U (3 G A n for every (3 G . Hence 

{/3|aU/3 G A„} 

is a (?-big set for every a G A n ; as a consequence, we have 

{a | {/3 | a U /3 G A„} £g} ep (for n G N), 

and thus po q e A. Finally, every p G A has to be uncongested, for A n+ i and C n 
are always disjoint. □ 

One conclusion is that uncongested idempotents do exist; more importantly, they 
naturally arise when one is looking at IP-rings in terms of ultrafilters. Indeed, the 
lemma is the exact analogon of Hindman's theorem for the case of IP-rings. 

Another useful property of uncongested ultrafilters is stated in the last lemma 
of this section; it closely follows our thoughts after (|1.2[) . 

Lemma 8. For any IP-set n,: T — * N, the induced map n» : f3T un — > /3N is a ho- 
momorphism of semigroups. In particular, n*(p) is idempotent for each uncongested 
idempotent p G (3T un . 

Proof. Let p, q G j3J 7Un be arbitrary uncongested ultrafilters; we need to show that 

n *(p °q) = ° «-*(<?)• 

For any subset A C N, we have A G n*(p) exactly when {a | n a G ^4} G p; referring 
back to the definition of the operation o on page [31 we then find that 

A<En*(poq) <==^ {a | {/3 | n aU/3 G A} G q} G p, 

while 

A G n*(p) o n*(g) <=^- {a | {/3 | n a + np £ 4} 6 g} £ p. 

But since q is uncongested, these two conditions are actually equivalent. Indeed, 
given a £ J 7 , we have n QU /3 = "a + whenever /3 > a, and so 

{/? | n aU/ 3 e A} r\{f3\(3> a} = {(3\ n aUfj G A and /3 > a} 

= {/5 | n a + up G A and /3 > q} 
= {/? | n a + n p G A] n {/3 | /3 > a). 
Now g always contains the set {/3 | /3 > a}, and so 

{(i | n aU p G A} G <? {/3 | n Q + n,a G A} e q. 

This shows that A G n,(pog) if, and only if, A G (p) o n* (g) , and thus proves 
the lemma. □ 

This result will later allow us to transfer results from /3N or /3Z™ to the space 
(3 T . We shall see applications of this idea in Section [5l after we have proved the 
main theorem. 
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2. An extended example 

In this section, we want to give an in-depth discussion of a special case of the 
main results, Theorem l5.il and Theorcm l5.2l We hope that this will help the reader 
understand the character of the argument — in particular, how the induction used 
in the proof works. We are going to consider the following theorem. 

Theorem 2.1. Let U be an arbitrary unitary operator on a Hilbert space TL, and 
let m,,n t : T — > N be any two IP-sets. If p G (3T is any uncongested idempotent, 
the operator P defined by the weak operator limit 

P = p-limZ7 m «* n « 

a 

is an orthogonal projection. 

The given (weak operator) limit abbreviates the equality 
(Px,y) = p-]im(U m ° n *x,y) 

for all x, y € Ti. 

We are not going to prove this directly, because the presence of the two IP-sets 
is inconvenient, in that it obscures part of the underlying structure. Suppose, for 
example, that the two IP-sets were (more or less) equal; then the essentially two- 
dimensional situation of the theorem would collapse down to a one-dimensional one, 
and surely something in the proof will have to change, too. The problem, in other 
words, is that there appears to be a notion of dimension behind the theorem — but 
it is cumbersome to deal with dimension for IP-sets. 

On the other hand, as shown by Lemma[6j there is a good definition of dimension 
for idempotent ultrafilters in /3Z 2 . Instead of trying to prove Theorem 12.11 in its 
present form, we should pass instead to the group Z 2 , where we can talk about the 
rank of subgroups and the dimension of ultrafilters. 

To this end, define a map <fi: T — > Z 2 by 

4>(a) = (m a ,n a ) (for a e JF). 

Because p is uncongested, the new ultrafilter q — 4>*(p) is an idempotent in (31? by 
Lemma |8] moreover, Lemma [2] changes the limit defining P into 

P = q-YiraU ZlZ2 , 

z 

where the notation z is again used for elements of Z 2 . The following more general 
statement now suggests itself. 

Theorem 2.2. Let U be a unitary operator on a Hilbert space TC. If q 6 /3Z 2 is 
any idempotent, the operator P defined by the weak operator limit 

P = q-\imU ZlZ2 

z 

is an orthogonal projection. 

Even in this special case, a proof seems to require two separate steps. We begin 
by introducing an auxiliary operator 

Q = g-limq-limC/ aiZ2+a2Zl , 

a z 

the polynomial in the exponent arising from the original Z1Z2 as 

(<2i + z 1 )(a 2 + z 2 ) - ziz 2 - a\a 2 . 
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Step 1. For the time being, we are going to assume that Q is a projection operator, 
and use the splitting of the Hilbert space TL = ker Q © im Q it induces to prove 
Theorem 12.21 To show that P is an orthogonal projection, we appeal to Lemma [5] 
P is clearly normal, being a limit of unitary operators, and so all we need to do is 
prove the relation P 2 = P. To help with that, let us also create, for each 5 6 Z 2 , 
the operator 

Q s = q-\imU aiZ2+a2Z1 . 

z 

Since the polynomials involved are linear in z , the reader will prove without much 
effort that each Qg is an orthogonal projection; for example, one can use the identity 
in to show that Q\ — Qs, and then apply Lemma [9] Of course, any two of 
those operators commute, since they are all limits of powers of U. 

The point is that under our assumption on Q, the weak operator limits 

Q = q-\imQ s = gdimq-lim[/ ai22+Q221 

a a z 

are actually strong ones, as we shall see. To prove the identity P 2 — P, let us first 
consider the situation on the space kerQ. If x satisfies Qx = 0, we get 

g-lim| I ^ 1 1 2 = g-liin/Qga;, x) = (Qx,x) = 0, 

a a 

and because we have convergence in the norm, we can apply van der Corput's trick 
to show Px = 0. The condition in Lemma [U 

q-limg-lim(j7 (Ql+2:i)(Q2+22) a;,C/ 2l22 x\ = q-limq-lim(u aiZ2+a2Zl x,U- aia2 x) 

a z \ I a z \ I 

= q-\im(Q s x,U~ aia2 x) =0, 



is satisfied, and we conclude that Px = 0, hence P 2 x = Px. 

Next, let us see what happens if x £ imQ. In this case, Qx — x, and we can 
write 

g-lim g -lim||C/ a2Z2+a22l a; - cell 2 

a z 

= 2\\x\\ 2 - 2-Req-limg-lim(C/ ai22+a22l a;,a;) = 2||x|| 2 - 2(Qx,x) = 0, 
from which it follows that 

(2.1) <j-limg-lim||f/ (Ql+Zl)(a2+22) a; - U aia2 U ZlZ2 x\\ = 0. 

a z 

To obtain P 2 x — Px, we make use of the identity in (jl.ip for double q-limits; 
together with (|2.ip . we obtain 

Px = q-lim[/ 2l22 a; = g-lim(7-lim[/ (ai+2l)(Q2+22) 2; 

z a z 

= q-\imq-\imU aia2 U ZlZ2 x = P 2 x. 



We are therefore able to show that P is an orthogonal projection, provided that 
Q is one. The device of getting strong from weak convergence is frequently useful, 
by the way; it is formalized in Lemma fTOl below. 
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Step 2. So far, we have been able to reduce Theorem 12.21 to the proof of the 
following, simpler result. 

Theorem 2.3. Let U be a unitary operator on a Hilbert space H. For any idem- 
potent q G (31? , the operator 

Q = qA\mq-\\mU aiZ2+a2Zl 

a z 

is an orthogonal projection. 

Just as in Step 1, everything hinges on having a good splitting of the underlying 
Hilbert space H. But which splitting one should use depends on the ultrafilter q, 
more precisely on its dimension — which could be 0, 1, or 2. We will treat these as 
separate cases here; in the proof of the main theorem, we shall of course want a 
unified approach. 

Dimension 0. If q is 0-dimensional, it contains the set {(0,0)}, and since we can 
restrict to a q-big set when taking limits, Q is simply the identity operator. So this 
case is trivial. 

Dimension 1. In case dimq = 1, we can find a subgroup Zc (with c ^ 0) of rank 
one in q. Accordingly, we will use the splitting H = Hi © H^, where 

Hi = P| keiQ nS and H{ = ^ im Q nS . 

It is then straightforward to show that Q is orthogonal projection onto 

Indeed, if x is an element of Hi, then Q n gx — holds for all nonzero n, and 
since c generates a <?-big subgroup and {(0,0)} is not <?-big, we get 

Qx = q-lim QfiX — 

a 

On the other hand, to show that Q restricted to Hj~ is the identity, we need only 
consider x £ im Q n s, as the span of these vectors is dense. For any such x, we have 

Q nS x = q-limU n(ciZ2+C2Zl) x = x, 

z 

which, as before, can be strengthened to 

(2.2) qA\™\\U n(ciZ2+C2Zl) x - x\\ = 0. 

z 

Now we need to extend this equality, true for only one vector nc, to some g-big set 
of vectors. We leave it for the reader to check that (|2.2[) actually gives 

q-\\m\\U Nn( - ClZ2+C2Zl) X - x\\ = 0. 

z 

for any N G Z. (Hint: Use a telescoping sum.) But the set Z • nc is again a <?-big 
subgroup (since q is idempotent), and so 

g-limg-lim||C/ ai22+a22l a; - x\\ = 0, 

a z 

which implies Qx — x. So Q is indeed an orthogonal projection, with image TL^. 
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Dimension 2. Finally, let us treat the really interesting case of a two-dimensional 
q. We use the same argument as before, only the splitting has to be adjusted a bit; 
instead of focusing on one specific subgroup (like Zc) , we shall consider all of them. 
So let 

U2 = P| ker Q S Q^ and Hi = im QsQ^ 

a, b a,b 

be the two complementary subspaces, where both the intersection and the sum are 
taken over those a, b S Z 2 for which the subgroup Za + Z6 has rank two. Again, it 
will turn out that Q is orthogonal projection onto H^- 

To prove that Q fixes every vector in H2, we may again limit our attention to 
elements x € im QsQ^ for two vectors a and 6 that span a rank two subgroup. The 
same argument as before shows that 

g-lim||C/ M(ai22+Q22l)+Ar(fclZ2+h22l) a; - x\\ = 

z 

for any M, N £ Z; the group generated by a and b is g-big (remember that it 
contains some lattice), and so we have 

g-lim||C/ ClZ2+C22l x-a;|| = 

for g-many c G Z 2 . Taking the g-limit over c then gives the result, namely that 
Qx — x. 

To finish the proof, we have to deal with an arbitrary x G H2 and show that 
Qx = 0. What we know is that QsQ^x = for any two vectors a and b with 
a two-dimensional span. This is a lot of information, since there are many such 
pairs — in fact, for any nonzero a G Z 2 , a g-big set of b has the required property. 
For suppose, to the contrary, that g-many vectors b could span only a subgroup of 
rank one together with a. As Za + Z6 is of rank one if and only if 6 is a multiple 
of a/g (here g is the greatest common divisor of the components of a), it would 
follow that Za/g was a g-big subgroup of Z 2 , contradicting our assumption on the 
dimension of g. 

In particular, we know QsQ^x — for sufficiently many a and b to conclude 
that 

Q 2 x — g-lim g-lim QsQ^x = 0; 

but now the operator Q is very evidently self-adjoint and so Qx — as well. This 
shows that Q is a projection and ends the proof of Theorem 12.31 

Conclusions. Let us end this section with several remarks concerning the nature 
of the proof. Firstly, the reader will have observed the balance — crude in the 
case of one-dimensional p, slightly more subtle for two dimensions — between the 
two spaces of the splitting. In the first space, Hi or H2, where we use the null 
spaces of projections, we need to intersect a large number of them to make up for 
the weakness of each individual piece; for each operator, we only know that one 
particular p-limit is zero, and that amounts to nothing by itself. On the other hand, 
the orthogonal complements, TL^ or TC2, involve image spaces of projections; the 
knowledge that we gain from each piece is far stronger here, and so we can afford 
to have this knowledge in only one case. 
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Secondly, it is clear that the dimension of the ultrafiltcr is important. It was 
pointed out before that, although the same concept is lurking around in Theo- 
rem 12.11 it is less easily quantified and dealt with there. The passage from fjJ- to 
(37? helps to make it visible, by removing the IP-sets. Moreover, it is of course 
unnecessary to handle the various dimensions by different arguments; the proof is 
really the same in all cases. Indeed, in Section [5j when proving the main theorem, 
the first step will be to adjust the dimension of the surrounding group to make it 
match that of the ultrafilter p. This is where Lemma |5] will play its part. 

Finally, the more general result in the main theorem requires more effort to 
prove; although the proof is, in essence, the same as the one given here, there are 
several technical points that need to be dealt with. In particular, the presence of 
polynomials of higher degree needs special care. The following two sections contain 
a few tools that will be helpful; all necessary results about polynomials are collected 
in Section [U 

3. Orthogonal projections and limits 

In this section, we prove two simple but useful results about orthogonal projec- 
tions and limits; these are well-known, of course. The first, which has already been 
used, gives a condition for an operator to be a projection. 

Lemma 9. A normal operator P on a Hilbert space TL is an orthogonal projection 
if, and only if, it satisfies P 2 = P . 

Proof. Necessity is clear. If P meets the condition, the product Q = P*P of P 
and its adjoint also does. The latter is self-adjoint in addition, hence satisfies 
(Qx, x — Qx) = (x, Q(x — Qx)) = and is therefore an orthogonal projection onto 

the image space of Q. For x <G kerQ, one has ||-P£|| 2 = (x,Qx) — 0; for x € imQ, 
one has Px — PQx — PP*Px — P*Px = x. Consequently, P — Q is an orthogonal 
projection. □ 

Our second lemma deals with the question of when certain 'weak' limits in a 
Hilbert space H are 'strong' limits and is meant to collect the pieces of reasoning 
used in the previous section. The whole discussion is somewhat vague but the 
result is useful, though nearly self-evident. Let 'lim' be an abbreviation for some 
unspecified p-limit, maybe even a multiple one, and let / be the corresponding index 
set. So for example, lim might equal p-lirmj p-limg , with both a and b ranging over 
Z 2 , in which case the index set / would be T? x Z 2 . 

By what we said in Section[T] the limit lima^ is defined, in the weak topology, for 
every bounded family (xi)i^j of points, as any closed ball in Tt is weakly compact. 
limXj = x thus means that for any y £H, 

lim(xi,y) = (x,y). 

On the other hand, the convergence is called strong if 

lim — x\\ = 0. 

The norm is weakly lower semi-continuous — if x = limxj, then 

1 1 x 1 1 < lim | \xi 1 1 . 

One can also define the notions of weak and strong operator limits; in fact, we 
have already been using these. We say, for example, that T is the weak operator 
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limit of a family (Tj)i g / of operators — and write T = limTj if 

lim(Tja;, y) = (Tx, y) (for x,y eH). 



A few simple calculations then give the following result. 

Lemma 10. Let lim and I be defined as above. 

(1) If T = limC/j is the weak operator limit of a family (£/j)i £ j of unitary 
operators, then T is normal. For x G H, one has lim||Ta; — Uix\\ = 
if, and only if, \\Tx\\ = \\x\\. In case T is an orthogonal projection, this 
happens exactly when Tx = x. 

(2) If T — limPj is the weak operator limit of a family (Pj), e / of orthogonal 
projections, then T is self-adjoint. For x G TL, one has lim||Tx — Pix\\ = 
if, and only if, (Tx, x—Tx) = 0. In case T is itself an orthogonal projection, 
this condition is always satisfied. 

The result looks innocent enough, but it will be used frequently. 



We shall be using polynomials in several variables for which the following nota- 
tion seems appropriate. Lower-case Roman letters with arrows will usually denote 
n-dimensional vectors, e.g. z = (z\, . . . , z n ). Wc shall be speaking of polynomials 
in the variable z, meaning really polynomials in the n variables z\, . . . , z n . The 
degree of such a polynomial will be its total degree. We shall also consider poly- 
nomials in several multi-dimensional variables: f(a, c), say, would be a polynomial 
in both sets of variables; the degree in a is the total degree of / as a polynomial in 
Oi, . . . , a n , and so on. 

If G is any Abelian group, we shall let G[z\, . . . , z s ] stand for the additive group 
of polynomials in zi, . . . , z s with coefficients in G; wc shall ignore the multiplicative 
structure. For the subgroup of those polynomials in Q[zi, . . . , z r ] that produce 
integer values for integer arguments, we shall write Int[zi, . . . , z s \. 

In the one-dimensional case, Int[x] consists of all polynomials / G Q[x] with 
/(Z) C Z. It is a free group with basis consisting of the polynomials 



Indeed, if / is any polynomial in Int[x] and m its degree, one may find m+ 1 integers 
a , . . . , a m such that 



by evaluating successively at x = 0,1, ... ,m, and solving the resulting system of 
equations. For any number d > 0, the polynomials of degree at most d form a free 
subgroup of rank d+1. 

The same argument, applied inductively, proves the following. 

Lemma 11. Int[ii, . . . , z s ] is always a free group; for any integer d > 0, the 
polynomials of total degree at most d constitute a free subgroup of finite rank, and 
so do the polynomials of degree at most d in each variable. 



4. Polynomials 




(for i > 0). 
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We now introduce one more useful notion. In the example in Section [2| when 
dealing with the polynomial f(x,y) = xy, we found it useful to form the new 
polynomial 

f(a + x,b + y)- f(a, b) - f(x, y), 

essentially because its degree in (x,y) was lower. An appropriate generalization is 
as follows. Given a polynomial f(z) S G[z] and an integer s > 1, we recursively 
define a new polynomial A s f(z\, . . . , z s ), by letting A 1 /(z*i) = f(z\), and 

(4.1) A s+1 f(z 1 ,...,z s+1 ) = 

A s f(z u ...,z s + z s+l ) - A7(zi, ...,z s )- A s f(z 1} . . . , z s+l ). 
Of course, A s can be described explicitly as 

#m, ...,z s )= J2 (- 1 ) sHq| • f(E *)> 

0/aC{l,...,s) iea 

and the symmetry in all arguments is more apparent from this description. 

Let us investigate some properties of A 5 . First, we have the following easy lemma. 

Lemma 12. The polynomial A 2 /(a, z) — f(a + z) — /(a) — /(•?) is of lower degree 
in each variable than /(-?) itself, whenever f(z) G G[z] is nonzero. 

Now let f(z ) be of degree d > 1. Since A s /(z*i, . . . , z s ) is symmetric in its s argu- 
ments, the lemma — together with the relations (|4.1|) — immediately shows that its 
degree in any variable can be at most (d+l — s). It follows that A d+1 /(^i, . . . , Zd+x) 
is a constant, with value 

A d+1 /(0, . . . , 0) - J2(-l) d+1 - k ^ + k X ) /(0) = (-l) d /(0). 

If / happens to satisfy /(0) = 0, one has A d+1 /(zi, . . . , Zd+i) = 0. For reasons 
of symmetry, A d f(z\, . . . , za) is then linear in each of its d arguments. 
We have shown the following. 

Lemma 13. For any polynomial f(z) G G[z] of degree d > 1, one has the relation 

A d+1 f(z 1: ...,z d+1 ) = (-lff(0). 

If /(0) = 0, then A f(z\, . . . , zj) is a linear function of each argument. 

A third lemma deals with the case of homogeneous /. 

Lemma 14. Let f(z) E G[z] be a homogeneous polynomial of degree d > 1. Then 

A d f(a,...,a) = d\f(a). 

Proof. Using homogeneity, we have 

A°f(a, ...,a) = ^(-ir fe Q f(ka) = ^(-l)- fc Q k d ■ f(a). 

We obviously have to evaluate sums of the form 

C(s,m) = Y J (-l) s ' k (f)k m (for s > l,m> 0); 

k=l ^ ' 
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in particular, C(d,d) = d\ is what we need to show. From the previous lemma, we 
already know that C'(s,m) = whenever s > m. Now we compute 

m+i / i i \ 

C(m + 1, m + 1) = J2 {-l) m+1 - k [I ) k m+1 
fc=i ^ ' 

m+l 



(_!)"•-(*-!) ( m+ l) 



fc=l 



(™+i)EHrf c + 



(m + 1) ^(-ir-' (7) + ( m + !) E ( " ) C '< '"• ' 1 



1=0 v 7 i=l 

= (1 - l) m + (m + l)C(m, m) = (m + l)C(m, m), 
and together with (7(1, 1) = 1 this proves the lemma by induction. □ 

We will now use the previous results to establish an important technical lemma; it 
is essential for the proof of the main theorem in Section [Sj Note that it introduces 
a feature not present in the example of Section [21 where we had to deal with 
polynomials of no more than first degree. It does, however, fit in with the general 
philosophy behind the argument — there is one situation in the proof where one has 
to make a lot from apparently nothing, meaning where one has to create useful 
p-big sets from useless ones, and the following lemma does just that. 

Lemma 15. Suppose that p G (ill 1 is an n- dimensional idempotent. Let G be an 
Abelian group and let v(z) £ G[z] be a polynomial in z = (z%, . . . , z n ). Fix a 
subgroup V C G. If the set 

B = {b eZ n \N ■ v(b) e V for some N ^ 0} 

is p-big, then so is the set 

A = {a eZ n \v(a)-v{0) eV}. 

Proof. The idea of the proof is simple: Whenever B contains IP(&i, . . . , b s ) , there is 
some integer N ^ such that N ■ A s v(bi, . . . , b s ) G V. Using this and the previous 
results, we can extract from v its homogeneous parts of different degrees, and show 
that they are each contained in V for p-many z. Proceeding stepwise, we shall 
prove two things: 

I. Without loss of generality, it may be assumed that v(0) = 0. 
II. The set A = {a \ v(a) G V) is p-big. 

The details are as follows. 

I. Let d be the degree of v(z). The set B is a member of the idempotent p, and 
by Lemma [TJ we may select d + 1 elements b\, . . . , bd+i in B with 

F = lP(b 1 ,...,b d+1 ) CB. 
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For each b in this finite IP-set, there exists some N ^ such that N ■ v(b) G V; 
if we let Ni be the greatest common divisor of these numbers, we guarantee that 
Ni ■ v(b) G V for every b G F. By Lemma IT^l we now have 

Nx ■ v(Q) = Nx ■ (-l) d A d+1 v(h, • • • , b d+1 ) G V. 

If b is any element of B and N ■ v(b) G V, the expression iVj, iV • (v(b) — v(0)) is also 
an element of V; this means that the set 

{b\N ■ (v(b) - v(0)) G V for some N ^ 0} 

is equally p-big. We may therefore replace v(z) by v(z) —v(0) and assume u(0) = 0. 

II. We decompose v into homogenous polynomials, 

d 

v(z) = ^2hi(z), 

i=l 

say, with hi(z) homogeneous of degree i. For 1 < i < d, let 

= {a | /ij(a) G V}. 

We shall argue that G p; once this is known, the same reasoning applies to the 
polynomial v(z) — hd(z) where it gives A^-i G p, and so on, until one has Ai G p 
for each i. The result follows because A contains the intersection of all A^. 

To show that A d is p-big, we use a similar — but more careful — approach as before. 
By Lemma [I] there are actually p-many b\, . . . ,bd G B with IP(&i, . . . , bd) Q B, 
and again, to each choice we may find some N\ ^ such that N\ ■ A d v(bi, . . . , bd) 
is an element of V. When selecting only 6i, . . . , bd-i from these, the set 

B d = {b | N ■ A d v(b 1: bd-ub) G V for some N ^ 0} 

is then always p-big by construction. 

Bd is also a subgroup of Z™, the polynomial A d v(z±, . . . , z<j) being linear in each 
variable (see Lemma II 3|) . Since p is n-dimensional, this subgroup has to have rank 
n and has to contain a set of the form L ■ Z ra for some nonzero L. For each b G Z n , 
one gets 

NL ■ A d v(h, . . . , b d -i,b) = N ■ A d v(h, b d -i,L ■ b) G V 

for some N 0, and so we conclude that Bd = U l . 
Next, consider the set 

B d -i = {b | for some N ^ 0, N ■ A d v(b 1 , b d - 2 , b, a d ) G V for all a d }; 

by the above, it is p-big, and a repetition of the argument shows that Bd-i = Z", 
too. Continuing in this way, we eventually find a nonzero integer N 2 such that 

N 2 -A d v(ai,...,a d )GV 

for any choice of ax, ... , G Z ra . 
By Lemma [T!21 we have 

A d h d (a 1 , ...,3 d ) = A d v(a 1 , a d ), 

because all terms of degree less than d disappear. Finally, using Lemma [HI we get 

N 2 dl ■ h d (d) = N 2 ■ A d v{d, . . . , a) G V 
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for all a, and hence hd(a) G V whenever a G N%d\ ■ 7L n . The latter set is p-big and 
so Ad G p. This ends the proof of the second part, and establishes the lemma. □ 

5. Statement and proof of the main results 

After all the preliminary work in the previous two sections, we are now ready 
to state and prove the main result. The notation is somewhat heavy, but this 
generality is needed because of the inductive nature of the proof. 

Theorem 5.1. For j = 1, . . . , s, letpj G [3Z nj be an idempotent, and let U\, . . . , U m 
be commuting unitary operators on a Hilbert space H. Given any m polynomials 
fi, ■ ■ ■ , f m G Int[zi, . . . , z s ] — with Zj of dimension nj — satisfying /i(0) = for all 
i = 1, . . . , m, define an operator P on TL by 



m 




Then P is always an orthogonal projection. Any two operators defined in this way 
commute. 

Proof. We will suppose that all nj are equal to some n and all pj are equal to some 
p, in order to simplify notation. The argument need not be changed in any way 
to accommodate the more general situation — because of the inductive character of 
the proof, we find ourselves working on no more than one p-limit at a time anyway. 

Let us first observe that the last part of the statement — commutativity of differ- 
ent projections — is obviously true, for all operators generated for various selections 
of polynomials are certain weak limits of commuting unitary operators. We may 
therefore assume commutativity wherever needed. 

The remainder of the proof is essentially by induction on the number s of p-limits 
taken, but there are some complications involving the case s = 1. In fact, different 
arguments are needed for s = 1 and for s > 2, since the outmost p-limit is one of 
unitary operators in the former situation, but one of projections in the latter. To 
make the induction be more transparent, we shall use the abbreviation (s,d) when 
referring to the statement of the theorem for a certain value of s and all possible 
choices of polynomials fi of degree at most d in any of their variables Zj . 

The proof will be divided into several steps, the second and third inductive in 
nature: 

I. We argue that the ultrafilter p may be assumed to be n-dimensional, with- 
out loss of generality. 
II. We establish the case (1, d), that is, we show that the operator 

m 

P = p-WmHuf^ 7 

with fi of degree at most d, is an orthogonal projection, assuming the state- 
ment of the theorem in the two cases (1, d — 1) and (2, d — 1). Specifically, 
we need to assume that the operators 

m 

Q s =p-\™T\ut [s+3) - u{s) - h{s) 

and 

Q = P-Iim Q s 

a 



IDEMPOTENT ULTRAFILTERS AND POLYNOMIAL RECURRENCE 



19 



are orthogonal projections. 

III. For s > 2, we derive (s, d) from (s— 1, d) and (1, d). Essentially, we introduce 
the new polynomials 

fl(a, z 2 ,..., z s ) = fi(a, z 2 , . . . , z s ) - fi(a, 0, . . . , 0) 

and assume that for each a eZ™, the operator 

m 

P s = p-lim • • -p-hmTT up aA '-" A) 

l—l 

is an orthogonal projection. We then construct a suitable splitting of the 
underlying Hilbert space TL. 

IV. Using the splitting introduced in the previous part, we show that P is an 
orthogonal projection. 

Once we have established all of the previous, our work will be done. For the 
statement of the theorem is definitely true in the case (1, 0) — if all fi equal zero, 
P is just the identity — and then |TT] and IIIII suffice to prove the entire theorem by 
induction. Let us now take a detailed look at the four steps. 

I. Using Lemma [6j we begin by adjusting the situation to make sure that the 
dimension of p is equal to the rank of the group. Of course, this will change the 
polynomials under consideration; but since their degrees are not increased, it does 
not affect the proof. To write this down precisely is somewhat cumbersome; so 
let us look at the case of just one operator U and one polynomial f(z) to see 
what happens. We shall use z for an n-dimensional and w for an s-dimensional 
variable. If s = dimp and <p are as in Lemma |6l then q = </>*(p) is idempotent and 
s-dimensional. By Lemma El 

(5.1) P -\imU i(S) =q-limU f( ' rl( -* 1)) . 

z w 

But g(w) = f{4>^ 1 (w)) S Int[w] satisfies g(0) — 0, and is of degree no larger than 
that of /; thus a proof that the right-hand operator in (15.11) is a projection gives 
the result for the left-hand one, too. The same is true in the general setting of the 
theorem, though somewhat unpleasant to write down in detail. 

In any case, we shall assume from now on that dimp = n. Lemma 1151 is then 
applicable; it will make its entry in the third step of the proof. 

II. As stated above, we shall now assume that both Q and all the Qg are orthogonal 
projections; this is permissible because each polynomial 

Md + z) - Md) - Mz) 

has degree at most (d — 1) in a and z (see Lemma |12[) . The projection Q induces 
a splitting TL = kerQ © imQ of the underlying Hilbert space; we shall use it to 
conclude that P 2 x = Px for all x G TL. 

First, consider x 6 kerQ. Since Qx — p-limg Qax, Lemma 1101 implies that the 
convergence is strong, 

p-limllQsxIl = 0. 

a 
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We now use van der Corput's trick to get Px — 0, the condition 

m m 

p-h>p-lim(jj uf l(S+Z) x, J] U^ S) x) 

a ' i=l i=l 

m rn 

=p-Kmp-lim/TT uM*+*)-M*i-M) x , TT 

i=l i=l 
m 

= p-\im(Q s x, J] U; h{S) x) = 
" i=l 

in Lemma [4] being fulfilled. A fortiori, P 2 x = Px. 

Second, consider an arbitrary s G imQ, which then satisfies Qx = x. We again 
get strong convergence from Lemma [TUl so that 



p-limp-limllTT Uf' {S+S) x - TT uf i{S)+MZ) x 

'i 

But then 

P 2 x = p-limp-limTT u{ i{S)+MS) x = p-hmp-limTT uf' {S+?) x 

2=1 2=1 

m 

= p-limTT[// ,(2 "' ) x = Px. 

i=l 

We now have P 2 = P; obviously, P is normal, and the result — that P is an orthog- 
onal projection — follows from Lemma [9l 

III. This is the most interesting part of the argument. We start from the inductive 
assumption that each Pg is an orthogonal projection, and aim for a useful splitting 
of the space TL, depending on the projections P& and the polynomials 

f-(a,z 2 , ...,z s ) = f t (a,z 2 , . . . ,z s ) - /i(a,0, ... ,0). 

Note that fl(a, 0, . . . , 0) = for any a, which means that the new polynomials f[ 
still satisfy the conditions of the theorem, while having fewer variables. 

After the introduction of gi{a) = fiifl, 0, . . . , 0), the operator P is then given by 
the limit 

m 

P = p-\im(j[ut {S) )Pa- 

° i=l 

We shall let F C Int[z*2, . . . , z s ] denote the set of polynomials of degree at most 
d in each variable; F is a free group of finite rank by Lemma 1111 The product F m 
is also free, as are all of its subgroups, and for any a G Z n , the vector 

v (a) = z 2 ,...,z s ),...,f m (a, z 2 ,...,z s )) 

is an element of F m . We introduce the notation V(ai, . . . , a r ) for the subgroup of 
pm g enera t ec j by the vectors v{a\), . . . , v{a r ). 

The crucial idea is to let r > be the maximal integer for which 

{ai | ■ • ■ {a r | V(ai, . . . , a r ) has rank r} G p- • •} Gp. 

Such an r has to exist, because we are working inside a fixed group of finite rank; 
if not even {ai | V{a\) has rank one} is in p, we set r — to keep the notation 



= 0. 
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consistent. Whenever ai, . . . ,a r are taken, in the correct order, from these nested 
sets, the rank of the group V(a*i, . . . , a r ) is r. 

With r being defined in that manner, one also has 

{eti | • • • {a r +i | V{ai, . . . , a r +i) has rank less than r} € p- ■ ■} G p. 

Intersecting with the previous set and using that p is an ultrafilter, we obtain 

{ai | • • • {a r | V(a\, . . . , a r ) has rank r and 

{b | V(ai, . . . , a r , b) also has rank r} 6 p} G p • • •} G p. 

But if V(di, . . . , a r ) and V(ai, . . . , a r , 6) both have rank r, it means that some 
nonzero multiple of b has to lie in the first group. We can therefore conclude from 
the previous line that 

{a*i | • • • |a r | V = V(ai, . . . , a r ) has rank r and 

{6 | N ■ v(b) G V for some JV 7^ 0} G p} G p • • •} G p. 



(5.2) 



Finally, let A denote the set of r-tuples (ai, . . . , a r ), taken in the right order 
from the nested sets in (|5.2|l ; for any one of them, the group V = V(ai, . . . , a r ) has 
rank r and the set {b | • G V for some N ^ 0} is p-big. 

We have now arrived at our destination — we shall use the splitting H — HitBHi, 
where 

Hi = f|kerP Sl • • • P Sr and Hi = ]T imP 3l • • • P Sr . 

A A 

This ends the third step; the proof, based on this splitting, that P is an orthogonal 
projection is contained in the remaining part of the proof. 

IV. It remains to prove that the operator P really is a projection. Because of 
the splitting from IIII1 we have two subspaces to consider. Let us begin with the 
one that is easier to handle, and show that P is zero on H\. If x G Hi, we have 
Pa x • • ■ Ps r x — for p-many a\ , . . . , a r , thus 

m 

P r x = p-lim- • -p-limfTT Uf p 3i . . . p~x = 0. 

1=1 

As P is self-adjoint, P r x — quickly leads to Px = 0. 

The complementary subspace H±, on the other hand, requires more attention. 
Here, we shall show that P is equal to another projection P', to be defined below, 
and constructed with the help of the inductive assumptions. So suppose that x G 
imPjj • • • Ps r for a certain tuple (01, . . . , a r ) G A; we shall reason that Px = P'x, 
which, by the usual density argument, is sufficient for equality on all of H^ . 

Since x lies in the image of the product Pg t ■ • ■ Ps r , we get Ps k x = x for each 
k = 1, . . . , r. Apply Lemma [TUl to get strong convergence, in the form 



p-lim - ■ •p-limllrr u( i{Sk ' 22 '- A) x - 

1=1 

One easily derives that for any r integers Ni, . . . , N r , 



0. 



p-lim - • - p-lim TT K 



N 1 f' z {a 1 ,z 2 ,...S s ) + ---+N r f' i (3 r ,z 2 ,...,z s ) ^ 



i=l 



= 0. 
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The vectors v(dk) span the group V — V(ai, . . . , a r ), and so we can conclude that 
the equality 



(5.3) p-lim ■ • -p-lim TT U, 



hi(z 2 , ■■■,£„) 

'x — x 



i=l 



is true for any element (hi, . . . , h m ) G V. 
We already saw, after (|5.2[) . that the set 



{b | N ■ v(b) G V for some N ^ 0}; 

is p-big, because of how the integer r was chosen. As a consequence of Lemma IT5| 
which applies because dimp = n, the set 

{c\v{c)-v{0) e ^} 

is now also p-big. Together with (|5.3p . this gives 



T fl(c,z 2 ,...,Z s )-f!(o,z 2 ,...,Z s ) x _ 



= 



x 



0. 



p-lim • • • p-lim TT t/, 

Z2 z s 1 1 - 1 - 1 - 

? 

for p-many c G Z n , hence 

m m 

p-hm p-lim ■ • •p-lim||TTf//* (c ^'---' &) a ; - TT 

c 5*2 5* s 1 1 A A 

i=l i=l 

The following computation now ends the proof of the fourth and last step: 

m 

Px = p-lim (JJ Uf^^PsX = 

C i=l 

m m 

= p-lim(jj uf S) ) p-lim • • ■ p-lim JJ uf i(S '* a '- A) x 



TYl 111 

(p-lim JJ C/f l(5 * } ) (p-lim • • • p-lim JJ 



.r 



i=i 

i(0,Z 2 ,...,Z 3 )" 

i=l ~° i=l 

By the hypotheses (1, d) and (s— 1, d), both bracketed expressions define orthogonal 
projections, which moreover commute with each other. Let P' be the projection 
operator defined as their product; then we have shown that Px — P'x, for all x G 
imP^ • • • Pg r . This proves that P is also an orthogonal projection when restricted 
to "Hi, and thus completes the proof. □ 

Comparison with the original. Since the main purpose of this paper is to re- 
prove the result of [T] using ultrafilters, it may be worthwhile to compare the proof 
there with the one just given. The overall argument is the same — a proof by in- 
duction, relying on a splitting defined in terms of certain groups, which depend on 
the data (the given IP-sets in pQ, the idempotent ultrafilter here). To make the 
inductive step work out, we have to allow for multiple p-limits, necessitating a more 
intricate argument; our proof is consequently slightly longer than the original one. 

We should point out that the third step of the proof uses the same splitting as 
that in pQ. The notion of dimension, mentioned before, is also apparent in the 
original paper, and has to be dealt with in much the same way. While we use 
Lemma [5] for this purpose, Bergelson, Furstenberg, and McCutcheon rely on the 
Millikcn- Taylor theorem to handle the different possible dimensions in a unified 
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manner. Several other auxiliary results, proved or quoted in the other paper, also 
occur at some point in our proof. 

Lastly, IP-limits have been replaced by limits along ultrafilters, which means 
that no subsequences (or more strictly sub-IP-rings) have to be chosen to get con- 
vergence. This adds much convenience to the argument. 

An IP- version. As in the example in Section [21 we can derive from the previous 
theorem a version with IP-sets; because of the many subscripts and superscripts, it 
is more complicated to write down. 

Theorem 5.2. For j = 1, . . . , s, let qj € f3J- be an uncongested idempotent, and let 
Ui, . . . , U m be commuting unitary operators on a Hilbert space Ti.. Given m poly- 
nomials ft, . . ., f m G Int[zi, . . . , z s ] — with Zj of dimension nj — satisfying fi(0) = 
for all i = 1, . . . , m, and given additionally IP-sets w % '^ , indexed by j — 1, . . . , s 
and k — 1, . . . ,rij, define an operator P on H. by 

n Mw- ,...,w~ ,...,w^ ,...,n- ) 
[/; Q1 Q1 Qs ° s 

Ml f i 

i=l 

Then P is always an orthogonal projection. Any two of the operators defined in this 
way commute. 

Proof. We use Lemma[2]and define the following maps. For each j = 1, . .. , s, let 

<fc:.F-Z% <t> j {a) = {wy,...,w n jj), 
and introduce new ultrafilters 

Since the original ultrafilters were uncongested, all pj are idempotents by virtue of 
Lemma El and we obtain 

m 

P = pi-lim- • ■p s -\im\\U f l dSl, -' Za) 

i— 1 

from Lemma [5J The result now follows from the previous theorem. □ 

Now Lemma [7] states that any IP-ring is contained in an uncongested idempo- 
tent of (3T; it follows that the conclusion of Theorem 15.21 holds equally well after 
replacing each ultrafilter limit by a limit over some IP-ring. We thus recover the 
main theorem of the original paper pQ, as we had set out to do. 

6. Consequences 

From the two theorems in the previous section, we can now derive several other 
results. In order to simplify the statements, we shall only consider single p-limits. 
Let us begin by showing why it is useful that the weak operator limits we considered 
are orthogonal projections. 

Theorem 6.1. Let (X,B,^i) be a probability measure space, and let A C X be a 
measurable set. Let Ti,.. . ,T m be commuting invertible measure-preserving trans- 
formations on X. Furthermore, assume that polynomials /i,...,/ m € Int[z] are 
given, where z — (z\, . . . , z n ), such that /i(0) = for alii = 1, . . . , m. 
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(1) For any idempotent p <E /3Z n , one has 



p-limJAn (n^ /,(?) ) ''A) > fi(Af 

(2) For any uncongested idempotent q 6 (3J- and for any collection of IP-sets 
wi, with j = 1, . . . ,n, one has 



\2 



/ m — 1 \ 

lim/xfAnd]^^'-'^) A )> M (A) 



?- 

Proof. We shall only prove the first statement; the argument will likely be familiar 
to the reader anyway. On the Hilbert space TL = L 2 (X, pi) , introduce m commuting 
unitary operators U±,..., U rn , defining Ui by the rule Uig = g o Tj for g 6 H. By 
virtue of Theorem 15. 1[ the operator 



P = 5>limJJf// 



i=l 

is an orthogonal projection onto some closed subspace of TL. Write g for the char- 
acteristic function of the set A, and introduce the abbreviation 

m 



T = l[T t i 

Then we have 

p-\imfx(A n T~ 1 A) = p-Wmlg, J[ uf i(g) g) = (g, Pg). 



i=l 

Now P is a projection; therefore, if e = 1 denotes the function identically equal to 
1 , of norm 1 1 e 1 1 = 1 , 

(g,Pg) = \\Pg\\ 2 = ||Pg|| 2 ||e|| 2 > (Pg,e) 2 = ( 9l Pe) 2 . 

Finally, Pe = e, since e is invariant under the action of the unitary operators Ui, 
and so 

(g,Pef = {g,e) 2 = fi(A) 2 . 
Combining the three displayed (in)equalities gives the desired result. □ 

The consequences of the preceding theorem are twofold. First, when applied 
to the case of a single measure-preserving transformation, the two inequalities in 
Theorem 16.11 show precisely that the sets 

{f(z)\zGZ n } 

and 

{/«,...,<) \aeT} 
are sets of nice recurrence; here / may be any polynomial in Int[z] satisfying /(0) = 
0, and wi, with j = 1, . . . , n, can be arbitrary IP-sets. 

Secondly, we can exploit the fact that the idempotent ultrafilters in Theorem l6.ll 
may be chosen arbitrarily. Under the assumptions made above, for any e > 0, the 
set 

R e = {z e Z" | fi(A n T^A) > ^{Af - e) 
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has to be contained in every idempotent ultrafilter in (31 n . The reader will re- 
member that this is equivalent to saying that R e is IP*, that is, intersects every 
IP-set of Z". In particular, every R e is a syndetic set, because the IP* property 
implies syndeticity. A special case of this result is Khintchine's recurrence theorem, 
which states that for a single measure-preserving transformation T, the sets of nice 
returns 

{n eN\n(An T- X A) > ^i(A) 2 - e} 
are syndetic. Several other applications may be found in the original paper pQ. 
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